Attributed Consistent Hashing for Heterogeneous Storage System
نویسندگان
چکیده
Cloud-scale storage system is an important building block of the cloud infrastructure. It demands the flexibility to distribute data and provide high I/O performance. Consistent hashing algorithm is widely used in large-scale parallel/distributed storage systems for the decentralized design, scalability and adaptability. It can evenly distribute data among nodes but lack efficiency in a heterogeneous environment. In this research, we propose a novel data placement algorithm, which is based on consistent hashing while making it more efficient for heterogeneous storage systems. By considering both the capacity and bandwidth attributes of nodes, our algorithm can make better use of heterogeneous devices. Our current proof-of-concept evaluations with a distributed stroage system, Sheepdog, show promising results.
منابع مشابه
CATS: Linearizability and Partition Tolerance in Scalable and Self-Organizing Key-Value Stores
Distributed key-value stores provide scalable, fault-tolerant, and selforganizing storage services, but fall short of guaranteeing linearizable consistency in partially synchronous, lossy, partitionable, and dynamic networks, when data is distributed and replicated automatically by the principle of consistent hashing. This paper introduces consistent quorums as a solution for achieving atomic c...
متن کاملTripletMail: Replicated E-Mail Storage
We describe TripletMail, a system for outsourcing e-mail backups. The typical corporate mail server that handles receiving, sending and storing e-mails is replaced by two components: A mail server for SMTP/IMAP and a storage system which maintains several encrypted copies of each e-mail in racks of inexpensive servers. In this thesis, we focus on two problems of the storage system: In inexpensi...
متن کاملMulti-probe consistent hashing
1 Abstract We describe a consistent hashing algorithm which performs multiple lookups per key in a hash table of nodes. It requires no additional storage beyond the hash table, and achieves a peak-to-average load ratio of 1 + ε with just 1 + 1 ε lookups per key.
متن کاملDistributed Metadata Management Scheme in HDFS
A Hadoop Distributed File System (HDFS) is designed to store very large data sets reliably and to stream those data sets at high bandwidth to user applications. Metadata management is critical to distributed file system. In HDFS architecture, a single master server manages all metadata, while a number of data servers store file data. This architecture can’t meet the exponentially increased stor...
متن کاملTransitive Hashing Network for Heterogeneous Multimedia Retrieval
Hashing has been widely applied to large-scale multimedia retrieval due to the storage and retrieval efficiency. Cross-modal hashing enables efficient retrieval from database of one modality in response to a query of another modality. Existing work on cross-modal hashing assumes heterogeneous relationship across modalities for hash function learning. In this paper, we relax the strong assumptio...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016